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COUNTER- BASED PHASE SHIFTER CIRCUITS AND METHODS WITH 
OPTIONAL DUTY CYCLE CORRECTION 

FIELD OF THE INVENTION 

[0001] The invention relates to clock phase shifter 
circuits. More particularly, the invention relates to 
counter-based phase shifter circuits and methods optionally 
having duty cycle correction and maximum value capabilities. 

BACKGROUND OF THE INVENTION 

[0002] Clock signals are used in virtually every 
integrated circuit (IC) and electronic system to control 
timing. For example, every time a rising edge occurs on a 
clock signal, all the flip-flops in a circuit might change 
state. Therefore, clock signals are often distributed widely 
throughout an IC or system. When a clock signal is widely 
distributed, inherent delays often cause various portions of 
the IC or system to receive the clock signal at different 
times. Further, setup and hold requirements can vary between 
different destination circuits. For these and other reasons, 
it is often desirable to perform a phase adjustment on a 
clock signal. 

[0003] Phase adjustments are typically performed using a 
phase-lock loop (PLL) or delay-lock loop (DLL) circuit. 
However, PLLs are analog in nature and take a long time to 
simulate, and a design that works in one manufacturing 
process may stop working when manufactured using another 
process. Therefore, PLLs are difficult to design, and often 
are not feasible in a given circuit or system. DLLs can also 
be complicated and difficult to design. Additionally, DLLs 
typically consume a great deal of silicon area. Therefore, 
clock phase adjustment is often not feasible using known 
circuits and methods. 

[0004] Therefore, it is desirable to provide circuits and : 
methods that enable a circuit designer to adjust the phase of 
an input clock signal without using a PLL or DLL, using a 
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fairly simple circuit that consumes a relatively small amount 
of silicon area. Preferably, such circuits and methods can 
optionally be implemented using the logic resources included 
in a programmable logic device ( PLD) . 

SUMMARY OF THE INVENTION 

[0005] The invention provides phase shifter circuits and 
methods that use counters to determine the positions of the 
output clock edges. A phase shifter circuit accepts an input 
clock signal and provides an output clock signal having a 
phase delay (a predetermined offset) from the input clock 
signal, e.g., delayed by a specified amount. A phase shifter 
circuit according to an embodiment of the invention includes : 
a plurality of counter circuits, each clocked by a count 
clock relatively much faster than the input clock signal. In 
some embodiments, each counter includes a small oscillator 
circuit implemented in the same fashion and generating a 
count clock for the counter. Thus, each counter uses a count 
clock having the same clock frequency. 

[0006] A first counter is periodically enabled to count 
for one input clock period, and the counted value (or a value 
derived from the counted value) is stored in a register. The 
stored value can be, for example, the counted value, half the 
counted value, the counted value minus a decrement value, and 
so forth. The stored value represents a maximum delay, i.e., 
a maximum permissible number of counts for the phase delay to 
be applied to the input signal. 

[0007] The applied delay between the input clock signal 
and the output clock signal (the "delay value") is either a 
set delay provided to the circuit (if no more than the 
maximum delay) or the maximum delay (if the set delay exceeds 
the maximum delay) . The delay value is provided to a second 
counter that counts (for example) from zero to the delay 
value. Thus, the second counter generates a first pulse 
partway through the input clock period, i.e., one delay value 
after an initial (e.g., rising) edge on the input clock 
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signal. The first pulse causes a first edge on the output 
clock signal. (The term "initial edge" is used herein to 
signify an edge on the input clock signal that is considered 
the starting point for the input clock period. The term 
"following edge" is used to signify the opposite edge from 
the initial edge.) A third counter running at the same clock 
rate then generates a second pulse at another time in the 
input clock cycle, as desired. This second pulse causes a 
second edge on the output clock signal. 
[0008] In some embodiments, the third counter starts 
counting in response to a following (e.g., falling) edge on 
the input clock signal, and stops counting after one delay 
value. Thus, the pulse width of the output clock signal is 
the same as the pulse width of the input clock signal. 
[0009] In other embodiments, the third counter starts 
counting in response to the pulse from the second counter 
(i.e., one delay value after the initial (e.g., rising) edge 
on the input clock signal) and stops counting after a delay 
of one-half of the input clock period. The length of one- 
half input clock period is derived from the number counted by 
the first counter. In these embodiments, the output clock 
signal has a 50 percent duty cycle. Therefore, these 
embodiments provide duty cycle correction (DCC) . 
[0010] In yet other embodiments, the duty cycle correction 
feature provides a different duty cycle correction (e.g., 
25%) , or can be enabled or disabled as desired. 
[0011] Other embodiments of the invention provide related 
structures and methods of providing a phased output clock 
signal from an input clock signal. 

BRIEF DESCRIPTION OF THE DRAWINGS 

[0012] The present invention is illustrated by way of 
example, and not by way of limitation, in the following 
figures . 

[0013] Fig. 1 illustrates a prior art phase shifter 
circuit that includes a delay lock loop (DLL) . 
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[0014] Fig. 2 illustrates a phase shifter circuit that 
allows a phase shift of up to one-half of an input clock 
period and does not provide duty cycle correction (DCC) . 
[0015] Fig. 2A illustrates a first output clock generator 
that can be used in the embodiment of Fig. 2. 

[0016] Fig. 2B illustrates a second output clock generator 
that can be used in the embodiment of Fig. 2. 
[0017] Fig. 3 is a timing diagram for the phase shifter 
circuit of Fig. 2. 

[0018] Fig. 4 is a schematic diagram of a counter circuit 
that can optionally be used to implement the second and third 
counter circuits in the embodiment of Fig. 2. 
[0019] Fig. 5 illustrates a phase shifter circuit that 
allows a phase shift of up to one-half of an input clock 
period and provides 50 percent DCC. 

[0020] Fig. 6 is a timing diagram for the phase shifter 
circuit of Fig. 5. 

[0021] Fig. 7 illustrates a phase shifter circuit that 
allows a phase shift of up to one-half of an input clock 
period and optionally provides or does not provide 50 percent 
DCC. 

[0022] Fig. 8 illustrates a phase shifter circuit that 
allows a phase shift of up to one-fourth of an input clock 
period and optionally provides or does not provide 25 percent 
DCC. 

[0023] Fig. 9 illustrates a phase shifter circuit that 
allows a phase shift of up to three- fourths of an input clock 
period and does not provide DCC. 

[0024] Fig. 10 illustrates a phase shifter circuit that 
allows a phase shift of up to three-fourths of an input clock 
period and provides 50 percent DCC. 

[0025] Fig. 11 illustrates a phase shifter circuit that 
allows a phase shift of up to three- fourths of an input clock 
period and optionally provides or does not provide 50 percent 
DCC. 
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[0026] Fig. 12 illustrates a phase shifter circuit that 
allows a phase shift of up one input clock period minus a 
predetermined decrement value, and does not provide DCC. 
[0027] Fig. 13 illustrates a phase shifter circuit that 
allows a phase shift of up one input clock period minus a 
predetermined decrement value, and provides 50 percent DCC. 
[0028] Fig. 14 illustrates a phase shifter circuit that 
allows a phase shift of up one input clock period minus a 
predetermined decrement value, and optionally provides or 
does not provide 50 percent DCC. 

[0029] Fig. 15 illustrates the steps of a first method of 
providing a phased output clock signal from an input clock 
signal, according to an embodiment of the present invention. 
[0030] Fig. 16 illustrates the steps of a second method of 
providing a phased output clock signal from an input clock 
signal, according to another embodiment of the present 
invention. 

DETAILED DESCRIPTION OF THE DRAWINGS 

[0031] In the following description, numerous specific 
details are set forth to provide a more thorough 
understanding of the present invention. However, it will be 
apparent to one skilled in the art that the present invention 
can be practiced without these specific details. 
[0032] Fig. 1 illustrates a known phase shifter circuit 
that includes a DLL and provides duty cycle correction (DCC) . 
The phase shifter circuit of Fig. 1 includes a duty cycle 
correction (DCC) portion 100, a buffer and clock tree 13 0, a 
match element 12 0 that matches the delay and loading of the 
buffer and clock tree 13 0, and a phase shift portion 140. 
[0033] The DCC portion accepts an input clock signal P0 
from the phase shift portion and provides a DCC-corrected 
output signal 50DCC to the buffer and clock tree 130. Four 
delay chains 101-104 each provide 128 tap signals to an 
associated multiplexer 105-108, respectively. Multiplexers 
105-108 are controlled by select signals SELECTl to provide 
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selected signals P25, P50, P75, and PlOO, respectively. 
Signal P100 is aligned with input signal PO . Signal P25 is 
offset by 90 degrees, signal P50 is offset by 180 degrees, 
and signal P75 is offset by 270 degrees from input signal PO . 
[0034] Signals P0, P25, P50, P75, and P100 are provided to 
clock generator 111, which generates the DCC-corrected output 
signal 50DCC to the buffer and clock tree 130. Signal P100 
is also provided to phase detector 1 (109), which compares 
signal P100 with input signal P0 . Phase detector 1 (109) 
provides control signals CTRLl (e.g., signals ADDl and 
SUBTRACT 1 , not shown) indicating a phase relationship between 
signals P0 and P100. State machine 1 (110) receives control 
signals CTRLl and provides signals SELECTl that control 
multiplexers 105-108 to select the appropriately delayed 
clock signals P25, P50, P75, and P100. 

[0035] Signal P0 is also provided to match element 120, 
which provides feedback clock signal FBCLK to phase shift 
portion 140. Phase shift portion 140 includes a 256-tap 
delay chain 142 and a 256-to-l multiplexer 141 driven by the 
tap outputs from delay chain 142, a 128-tap delay chain 143 
and a 128-to-l multiplexer 144 driven by the tap outputs from 
delay chain 143, multiplexers 145-147, phase detector 2 

(148), and state machine 2 (149). 

[0036] Delay chain 142 and multiplexer 141 push all five 
signals P0, P25, P50, P75, and P100 forward to align signal 
FBCLK2 with signal CLKIN2 . Delay chain 142 is driven by 
signal CLKIN, and multiplexer 141 is controlled by signals 
SELECT2 from state machine 2 (149) . State machine 2 (149) 
sets the values of signals SELECT2 based on signals CTRL2 
from phase detector 2 (148), which compares the phases of 
signals CLKIN2 and FBCLK2 . 

[0037] The phase shift is added by offsetting either 
signal CLKIN2 or signal FBCLK2 using delay chain 143 and 
multiplexer 144. Thus, delay chain 143 is driven by either 
signal CLKIN or signal FBCLK, with the selection being 
determined by multiplexer 145. Multiplexer 145 can be 
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controlled, for example, by an externally-supplied input 
signal or a value stored in a memory cell (not shown) . The 
amount of phase delay to be inserted is determined by 
multiplexer 144, which can also be controlled, for example, 
by externally-supplied input signals or values stored in 
memory cells (not shown) . Multiplexers 146-147 are then 
controlled to determine whether a positive phase shift, a 
negative phase shift, or no phase shift is to be applied. 
Multiplexers 146-147 can also be controlled, for example, by 
externally-supplied input signals or by values stored in 
memory cells (not shown) . 

[0038] When no delay is added to either clock path (i.e., 
multiplexer 146 selects signal CLKIN to be provided as signal 
CLKIN2 and multiplexer 147 selects signal FBCLK to be 
provided as signal FBCLK2), no phase shift occurs. When 
delay is added to the FBCLK2 path (i.e., multiplexer 146 
selects signal CLKIN, multiplexer 145 selects signal FBCLK, 
and multiplexer 147 selects the output signal from 
multiplexer 144), a negative phase shift occurs. In other 
words, signal FBCLK is shifted earlier compared to signal 
CLKIN. When delay is added to the CLKIN2 path (i.e., 
multiplexer 147 selects signal FBCLK, multiplexer 145 selects 
signal CLKIN, and multiplexer 146 selects the output signal 
from multiplexer 144), a positive phase shift occurs. In 
other words, signal FBCLK is shifted later compared to signal 
CLKIN. 

[0039] A significant disadvantage of the circuit of Fig. 1 
is that implementing DLLs typically requires a large amount 
of circuitry, as can be seen from Fig. 1. The circuit can be 
reduced in size by using PLLs rather than DLLs. However, 
PLLs are very proces s -dependent , as described in the 
background section above, and digital solutions are often 
preferred. 

[0040] Fig. 2 illustrates a circuit requiring less 
circuitry than the circuit of Fig. 1, and also supporting a 
wider range of operating clock frequencies for the input 
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clock signal. However, DCC is not provided in this 
embodiment. The circuit of Fig. 2 is small enough, for 
example, to be a practical addition to user circuits 
implemented in programmable logic devices (PLDs) such as 
field programmable gate arrays (FPGAs ) and complex 
programmable logic devices (CPLDs) . The circuit of Fig. 2 
supports a phase shift of up to half of an input clock 
period. 

[0041] Fig. 2 is a schematic diagram of a phase shifter 
circuit according to one embodiment of the invention that 
includes three counter circuits 200 and 213-214, a divide-by- 
two register 210, a maximum value circuit (comparator 212 and 
multiplexer 211), an inverter 216, and an output clock 
generator (flip-flop 215) . The circuit of Fig. 2 operates as 
shown in Fig. 3. Thus, the combination of Figs. 2 and 3 
should be consulted in conjunction with the following 
explanation of the circuit of Fig. 2. 

[0042] Counter circuit 1 (200) uses a relatively faster 
clock signal (e.g., generated by an oscillator circuit 206) 
to count a number of counts P (QC[N:0]) in one period of 
input clock signal CLKIN. The number of counts P is divided 
by two and stored in register 210. Thus, register 210 stores 
the value P/2, which represents the number of counts in one- 
half of the input clock period. The value P/2 is stored as 
signals Q180[N:0]. In the pictured embodiment, the number of 
counts P is recounted during every fifth input clock cycle. 
(N is an integer. In some embodiments, N is eight, but N can 
have also other values, as desired.) 

[0043] To divide a multi-bit value by two, the least 
significant bit is dropped and the remaining bits are shifted 
one bit to the right, as is well known. Therefore, in some 
embodiments the number of bits stored in register 210 is one 
fewer than the number of bits in counter circuit 200. 
However, the number of bits is kept consistent through the 
drawings herein, for simplicity. 
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[0044] The maximum value circuit functions as follows to 
determine a delay value Q_D[N:0] that controls the magnitude 
of the phase shift. The maximum value circuit ensures that 
the delay value Q_D[N:0] does not exceed one-half of one 
input clock (CLKIN) period. Comparator 212 performs a 
comparison between a set delay S_D[N:0] and the maximum value 
(P/2, or Q180[N:0], in the pictured embodiment), and provides 
a high value on output signal BIGGER if the set delay exceeds 
the maximum value. Such comparators are well known in the 
relevant arts, and any appropriate implementation can be 
used. The set delay can be provided, for example, by an 
external bus, a value stored in on-chip memory, other 
circuitry included in the same integrated circuit, and so 
forth. Signal BIGGER controls multiplexer 211 to select the 
delay value Q_D[N:0]. If the maximum value is exceeded, the 
delay value Q_D[N:0] is set to the maximum value. Otherwise, 
the delay value Q_D[N:0] is set to the set delay S__D[N:0]. 
In the pictured embodiment, if the maximum value is exceeded 
the circuit also signals this condition by setting output 
signal OVER_LIMIT high. 

[0045] The maximum value circuit shown in Fig. 2 is purely 
exemplary. In some embodiments, the maximum value circuit is 
omitted, and the delay value Q_D[N:0] is the output of 
register 210 (e.g., signals Q180[N:0]). In some embodiments, 
the maximum value circuit is implemented in some other 
fashion. 

[0046] Note that divide-by- two register 210 serves to 
provide a maximum delay value of P/2 for the phase shifter 
circuit of Fig. 2. In some embodiments, the maximum delay 
has a value other than P/2. For example, if register 210 is 
replaced by a divide-by- four register, the maximum value is 
P/4. Other embodiments described herein illustrate examples 
of maximum values other than P/2 (see Figs. 8-14) . 
[0047] Returning now to Figs. 2 and 3, counter circuits 2 
and 3 (213 and 214, respectively) are clocked by similar 
faster clock signals (e.g., generated by oscillator circuits 
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having the same design and configuration as oscillator 
circuit 206) . Counter circuit 2 (213) starts counting when 
input clock signal CLKIN (ClkStart) goes high, and compares 
the count value with the delay value Q_D[N:0] (Qstop [N: 0] ) . 
When the value in counter circuit 2 reaches the delay value 
(at time T2 of Fig. 3), counter circuit 2 (213) provides a 
high output pulse on signal SETP. The high output pulse 
clocks a high value from power high VDD into the D terminal 
of flip-flop 215, driving signal CLKOUT high at time T2, as 
shown in Fig. 3. The high output pulse also resets counter 
circuit 2 . 

[0048] Counter circuit 3 (214) starts counting when input 
clock signal CLKIN goes low, because the clock start terminal 
(ClkStart) of counter circuit 3 is driven by signal CLKINB 
from inverter 216. Counter circuit 3 then compares the count 
value with the delay value Q_D[N:0] (Qstop [N: 0] ) . When the 
value in counter circuit 3 reaches the delay value Q_D[N:0] 
(at time T3 of Fig. 3), counter circuit 3 (214) provides a 
high output pulse on signal RSTP. The high output pulse 
resets flip-flop 215, driving signal CLKOUT low at time T3 , 
as shown in Fig. 3. The high output pulse also resets 
counter circuit 3 . 

[0049] Figs. 2A and 2B illustrate exemplary alternative 
implementations of the output clock generator of Fig. 2. 
Other implementations can also be used, as will be apparent 
to those of skill in the relevant arts. The circuit of Fig. 
2A includes a reset flip-flop 221 and a logical OR gate 222 
(which can be implemented, for example, as a NOR gate 
followed by an inverter) . The data input D of flip-flop 221 
is coupled to power high VDD, the clock input CK is coupled 
to receive signal SETP, and the output terminal Q is coupled 
to output clock terminal CLKOUT, as in Fig. 2. However, the 
reset terminal of flip-flop 221 is coupled to the output 
terminal of logical OR gate 222, which is driven by signals 
RSTP and RST. Thus, when the phase detector circuit of Fig. 
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2 is reset by a high value on signal RST, output clock signal 
CLKOUT is driven low. 

[0050] The circuit of Fig. 2B includes a set/reset flip- 
flop 231. The data input D of flip-flop 231 is coupled to 
power high VDD, the clock input CK is coupled to receive 
signal SETP, the reset terminal R is coupled to receive 
signal RSTP, and the output terminal Q is coupled to output 
clock terminal CLKOUT, as in Fig. 2. However, flip-flop 231 
also has a set terminal coupled to receive circuit reset 
signal RST. Thus, when the phase detector circuit of Fig. 2 
is reset by a high value on signal RST, output clock signal 
CLKOUT is driven high. 

[0051] In the pictured embodiment, counter circuit 1 (200) 
is designed to recount the length of the input clock pulse 
(i.e., to generate a new value of P) every fifth clock cycle. 
In other embodiments, the periodicity of the count has other 
values, e.g., the desired periodicity can be selected based 
on the stability of the input clock frequency. In other 
embodiments, other implementations of counter circuit 200 are 
used. Any appropriate embodiment can be used. 
[0052] Counter circuit 2 00 has as inputs input clock 
signal CLKIN and reset signal RST. Counter circuit 200 
provides a clock update signal CLK_UPDT and a counter output 
bus QC[N:0] representing the value P. In the pictured 
embodiment, counter circuit 200 includes a counter 207, which 
counts a number of counts in one period of the input clock 
signal CLKIN. 

[0053] In the pictured embodiment, counter circuit 200 
includes reset flip-flops 201-204, set flip-flop 205, 
oscillator circuit 206, and counter 207. Flip-flops 201-205 
are coupled in series, and serve to provide three non- 
overlapping clock pulses in the following repeating sequence: 
RST_QP, CLK_QP, and CLK_UPDT. Each of these signals is high 
for only one clock cycle. 

[0054] Initially, signal RST_QP is high, because flip-flop 
205 is a set flip-flop, and counter 207 is reset. The first 
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rising edge of signal CLKIN brings signal RST_QP low. On the 
second rising edge of signal CLKIN, signal CLK__QP provides a 
high value on oscillator enable signal EN to oscillator 
circuit 206. Thus, oscillator enable signal EN is high for 
one out of every five input clock cycles. Oscillator circuit 
206 generates a relatively fast oscillator output signal OUT 
(i.e., faster than input clock signal CLKIN) whenever signal 
EN is high. The oscillator output signal CLK_P is used by 
counter 207 to measure the input clock period. Thus, in the 
pictured embodiment counter 207 performs the counting process 
only during one input clock period out of each five input 
clock periods. On the fourth rising edge of signal CLKIN, 
signal CLK_UPDT goes high. On the fifth rising edge of 
signal CLKIN, signal RST_QP goes high again, resetting 
counter 2 07. The cycle then repeats each five clock cycles. 
[0055] Note that the number of flip-flops coupled in 
series in counter circuit 200 determines the frequency with 
which the length of the input clock period is determined. 
For example, in the embodiment of Fig. 2, the period is 
measured every five clock cycles. By adding another reset 
flip-flop to the chain (e.g., in front of set flip-flop 205), 
the period would be measured every six clock cycles, and so 
forth. It will be clear to one of skill in the relevant arts 
that this selection is a matter of design choice. 
[0056] Fig. 4 is a schematic diagram of one counter 
embodiment that can be used to implement counter circuit 213- 
214 of Fig. 2. This embodiment can be used, for example, 
when counter 200 is implemented as shown in Fig. 2. The 
counter circuit of Fig. 4 includes a reset flip-flop 401, an 
oscillator circuit 402, a counter 403, a comparator 404, a 
NOR gate 406, and an inverter 405. When signal ClkStart goes 
high, the power high VDD value is clocked into flip-flop 401, 
driving flip-flop output signal OSC_EN high and enabling 
oscillator 402. Oscillator 402 provides an output signal 
OSC_OUT having a frequency relatively much higher than that 
of input clock signal CLKIN. Counter 403 is clocked by 
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oscillator 402, and thus counts the oscillations on signal 
OSC_OUT. The value stored in counter 403 is passed to 
comparator 404, where it is compared to the value Qstop[N:0]. 
When the values match, signal Pulse goes high, resetting 
flip-flop 401 and counter 403 via NOR gate 406 and inverter 
405. Signal Reset can also reset flip-flop 401 and counter 
403 via NOR gate 406 and inverter 405. 
[0057] Any known oscillator circuit can be used to 
implement oscillator circuit 402 of Fig. 4 and oscillator 
circuit 206 of Fig. 2. Preferably, the same implementation 
is used for all three oscillator circuits in the phase 
shifter circuit, as this approach creates a high correlation 
among the three oscillators irregardless of external factors 
such as processing and temperature variations. For example, 
the well known ring oscillator design (e.g., a loop including 
an odd number of logic gates, e.g., two inverters and a NAND 
gate driven by the enable signal) can be used. This 
embodiment is particularly useful when the phase shifter 
circuit is implemented in a PLD, because the oscillator can 
be implemented using the programmable logic blocks of the 
PLD. In some embodiments, external oscillators are used. 
[0058] Any known counter or counters can be used to 
implement counter 403 of Fig. 4 and/or counter 207 of Fig. 2. 
For example, the well known ripple counters can be used. In 
some embodiments, double-edge flip-flops are used to double 
the count stored in the counter. In some embodiments, a 
first subset of the counter circuits use double-edge flip- 
flops with a first oscillator frequency, while a second 
subset of the counter circuits use single-edge flip-flops 
with a second oscillator frequency twice that of the first 
oscillator frequency. 

[0059] Any known comparator can be used to implement 
comparator 404 of Fig. 4. For example, the well known 
exclusive-NOR (XNOR) implementation can be used, wherein each 
pair of bits is provided to an XNOR gate, the XNOR gates are 
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combined using NAND gates, and the NAND gates each drive a 
NOR gate providing the Pulse output signal in Fig. 4. 
[0060] Fig. 5 is a schematic diagram of a phase shifter 
circuit according to another embodiment of the invention that 
provides 50 percent DCC. The embodiment of Fig. 5 is similar 
to that of Fig. 2, but counter circuit 3 (214) has different 
clock start and clock stop values, altering the position of 
the falling edge on the output clock signal CLKOUT. Note 
that the elements of Fig. 5 are similar to those of Fig. 2, 
and are similarly interconnected, except as is now described. 
Fig. 6 is a timing diagram for the phase shifter circuit of 
Fig. 5. 

[0061] As in the embodiment of Fig. 2, counter circuit 1 
(200) uses a relatively faster clock signal (e.g., generated 
by oscillator circuit 206) to count a number of counts P in 
one period of input clock signal CLKIN. The number of counts 
P is divided by two and stored in register 210 as signals 
Q180[N:0]. The number of counts P is recounted every fifth 
clock cycle. Counter circuits 2 and 3 (213 and 214) are 
clocked by similar faster clock signals (e.g., generated by 
oscillator circuits having the same design and configuration 
as oscillator circuit 206) . Counter circuit 2 (213) 
functions in the same fashion as in the embodiment of Fig. 2, 
driving output clock signal CLKOUT high at time T2 . 
[0062] Counter circuit 3 (214) starts counting when signal 
SETP goes high at time T2 . Counter circuit 3 then compares 
the count value with the P/2 value Q180[N:0] (Qstop [N: 0] ) . 
When the value in counter circuit 3 reaches the P/2 value 
Q180[N:0] (at time T4 of Fig. 6), counter circuit 3 (214) 
provides a high output pulse on signal RSTP. The high output 
pulse resets flip-flop 215, driving signal CLKOUT low at time 
T4, as shown in Fig. 6. The high output pulse also resets 
counter circuit 3 . Note that the high pulse on the output 
clock signal CLKOUT has a duration of one-half of the input 
clock period, i.e., the output clock signal CLKOUT has a 50 
percent duty cycle. 
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[0063] The embodiment of Fig. 7 optionally provides or 
does not provide DCC, as controlled by a DCC enable signal 
DCCEN . The embodiment of Fig. 7 is similar to those of Figs. 
2 and 5, but provides a selection of input values for counter 
circuit 3 (214), thereby providing control over the position 
of the falling edge on the output clock signal CLKOUT. Only 
the differences from the earlier figures are described. 
[0064] When DCC is enabled (DCCEN is high) , multiplexer 
722 selects signal SETP to provide to the clock start 
terminal (ClkStart) of counter circuit 3 (214), and 
multiplexer 721 selects the P/2 value Q180[N:0] to provide to 
the clock stop terminal (Qstop(N:0]) of counter circuit 3 
(214) . Thus, the circuit of Fig. 7 behaves like the circuit 
of Fig. 5, and output clock signal CLKOUT has a 50 percent 
duty cycle. 

[0065] When DCC is disabled (DCCEN is low) , multiplexer 
722 selects signal CLKINB (provided by inverter 723) to 
provide to the clock start terminal (ClkStart) of counter 
circuit 3 (214) , and multiplexer 721 selects the delay value 
Q_D[N:0] to provide to the clock stop terminal (Qstop(N:0]) 
of counter circuit 3 (214) . Thus, the circuit of Fig. 7 
behaves like the circuit of Fig. 2, and output clock signal 
CLKOUT has the same duty cycle as input clock signal CLKIN. 
[0066] As previously described, in the embodiment of Fig. 
2 divide-by- two register 210 serves to provide a maximum 
delay value of P/2 for the illustrated phase shifter circuit. 
In the embodiment of Figs. 5 and 7, divide-by- two register 
210 provides a maximum delay value of P/2, and also serves to 
provide a DCC value of P/2 for the output clock signal (i.e., 
the output clock signal has a 50 percent duty cycle) . Fig. 8 
illustrates a variation of the circuit of Fig. 7 in which the 
maximum value and the DCC value are both P/4. When the DCC 
value is P/4, the output clock signal has a 25 percent duty 
cycle. 

[0067] Fig. 8 is similar to Fig. 7, except that divide-by- : 
two register 210 is replaced by divide-by- four register 810. 
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Register 810 divides the count value QC[N:0] by four, e.g., 
by dropping the two least significant bits and shifting the 
remaining bits two bits to the right. Therefore, in some 
embodiments the number of bits stored in register 810 is two 
fewer than the number of bits in counter circuit 200. 
[0068] The output signals Q90[N:0] stored in divide-by- 
four register 810 represent P/4. Signals Q90[N:0] are used 
by the maximum value circuit to determine the delay value 
Q_JD[N:0], and by multiplexer 721 as the clock stop value for 
counter circuit 3 (214) when DCC is enabled. Thus, signals 
Q90[N:0] represent both the maximum value and the DCC value. 
[0069] If a similar variation is applied to the circuit of 
Fig. 2 (i.e., replacing divide-by-two register 210 by a 
divide-by- four register) , the resulting circuit does not 
provide DCC and has a maximum delay value of P/4. If a 
similar variation is applied to the circuit of Fig. 5, the 
resulting circuit always provides DCC and has both a maximum 
value and a DCC value of P/4. 

[0070] Fig. 9 is similar to Fig. 2, except that divide-by- 
two register 210 of Fig. 2 is replaced by adder 921 and 
register 922. Adder 921 adds together two values, P/2 (GND, 
QC[N:1]) and P/4 (GND, GND, QC[N:2]). Any appropriate adder 
design can be used to implement adder 921, e.g., a ripple 
adder or a bypass adder. The resulting sum O_270[N:0] is 
stored in register 922. The stored value Q270[N:0] 
represents the number of counts in three- fourths of one input 
clock cycle. Thus, signals Q270[N:0] represent a maximum 
delay value of 3P/4 for the circuit of Fig. 9. 
[0071] Fig. 10 is similar to Fig. 5, except that in 
addition to divide-by- two register 210, an adder 921 and 
register 922 are included. Adder 921 adds together two 
values, P/2 (GND, QC[N:1]) and P/4 (GND, GND, QC[N:2]). The 
resulting sum O_270[N:0] is stored in register 922. The 
stored value Q270[N:0] represents the number of counts in 
three-fourths of one input clock cycle. Thus, signals 
Q270[N:0] represent a maximum delay value of 3P/4 for the 
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circuit of Fig. 10. The circuit of Fig. 10, like the circuit 
of Fig. 5, provides an output signal CLKOUT with a 50 percent 
duty cycle. The DCC value of Q180[N:0] (P/2) is provided by 
register 210. 

[0072] Fig. 11 is similar to Fig. 7, except that in 
addition to divide-by-two register 210, an adder 921 and 
register 922 are included. Adder 921 adds together two 
values, P/2 (GND, QC[N:1]) and P/4 (GND, GND, QC[N:2]). The 
resulting sum O_270[N:0] is stored in register 922. The 
stored value Q270[N:0] represents the number of counts in 
three-fourths of one input clock cycle. Thus, signals 
Q270[N:0] represent a maximum delay value of 3P/4 for the 
circuit of Fig. 11. The circuit of Fig. 11, like the circuit 
of Fig. 7, optionally provides or does not provide 50 percent 
DCC for the output clock signal CLKOUT. The optional DCC 
value of Q180[N:0] (P/2) is provided by register 210. 
[0073] Fig. 12 is similar to Fig. 2, except that divide- 
by- two register 210 is replaced by a decrementer 1031 and 
register 922. Decrementer 1031 subtracts a predetermined 
value (e.g., one or two) from the input signals QC[N:0]. Any 
appropriate decrementer design can be used to implement 
decrementer 1031, e.g., a ripple subtractor with one set of 
inputs tied to 0...01. The resulting value OUT[N:0] is 
stored in register 922. The stored value QSUB[N:0] 
represents the number of counts in one input clock cycle 
minus the predetermined subtracted value. Thus, signals 
QSUB[N:0] represent a maximum delay value of P-s, where s is 
the predetermined subtracted value. 

[0074] The embodiment of Fig. 12 can be useful, for 
example, when the largest possible phase delay is desired. 
If the set delay is actually equal to P, the circuit might 
not function properly. Therefore, "backing off" the maximum 
delay by one or two counts (or by some other number of 
counts) from the value P can ensure that the circuit 
functions properly, while allowing the largest reliable value 
for the phase delay. 
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[0075] Fig. 13 is similar to Fig. 5, except that in 
addition to divide-by- two register 210, a decrementer 1031 
and register 922 are included. Decrementer 1031 subtracts a 
predetermined value (e.g., one or two) from the input signals 
QC[N:0]. The resulting value OUT[N:0] is stored in register 
922. The stored value QSUB[N:0] represents the number of 
counts in one input clock cycle minus the predetermined 
subtracted value. Thus, signals QSUB[N:0] represent a 
maximum delay value of P-s, where s is the predetermined 
subtracted value. The circuit of Fig. 13, like the circuit 
of Fig. 5, provides an output signal CLKOUT with a 50 percent 
duty cycle. The DCC value of Q180[N:0] (P/2) is provided by 
register 210. 

[0076] Fig. 14 is similar to Fig. 7, except that in 
addition to divide-by- two register 210, a decrementer 1031 
and register 922 are included. Decrementer 1031 subtracts a 
predetermined value (e.g., one or two) from the input signals 
QC[N:0]. The resulting value OUT[N:0] is stored in register 
922. The stored value QSUB[N:0] represents the number of 
counts in one input clock cycle minus the predetermined 
subtracted value. Thus, signals QSUB[N:0] represent a 
maximum delay value of P-s, where s is the predetermined 
subtracted value. The circuit of Fig. 14, like the circuit 
of Fig. 7, optionally provides or does not provide 50 percent 
DCC for the output clock signal CLKOUT. The optional DCC 
value of Q180[N:0] (P/2) is provided by register 210. 

[0077] Fig. 15 illustrates the steps of a first novel 
method of providing a phased output clock signal from an 
input clock signal, according to another embodiment of the 
invention. In some embodiments, the steps of Fig. 15 are 
performed by a circuit implemented in a programmable logic 
device (PLD) , for example in a field programmable gate array 

(FPGA) or a complex programmable logic device (CPLD) . 

[0078] In step 1501, a first number of counts between 
successive initial edges of an input clock signal is counted. 
For example, in the embodiment of Fig. 2, counter 1 (200) 
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counts a number P of oscillator clock cycles within a single 
period of input clock signal CLKIN . In some embodiments, the 
initial edges are rising edges. 

[0079] In step 1502, a delay value is determined, based at 
least in part on the first number. The delay value can be, 
for example, a set delay provided by a PLD user. In some 
embodiments, the set delay value is compared with a maximum 
value. If the set delay does not exceed the maximum value, 
the set delay value is selected as the delay value. If the 
set delay exceeds the maximum value, the maximum value is 
selected as the delay value. 

[0080] In some embodiments, the maximum value is 
determined by dividing the first number by two, by four, or 
by another number. In some embodiments, the maximum value is 
determined by adding together one-half of the first number 
and one-quarter of the first number. In some embodiments, 
the maximum value is determined by subtracting a 
predetermined value from the first number. 

[0081] In step 1503, a second number of counts is counted 
following each initial edge of the input clock signal, and 
the second number is compared with the delay value. 
[0082] In step 1504, a first edge is provided on an output 
clock signal when the second number reaches the delay value. 
In some embodiments, the first edges are rising edges. 
[0083] In step 1505, a third number of counts is counted 
following each following edge of the input clock signal, and 
the third number is compared with the delay value. In some 
embodiments (i.e., where the initial edges are rising edges), 
the following edges are falling edges. 

[0084] In step 1506, a second edge is provided on the 
output clock signal when the third number reaches the delay 
value. In some embodiments (i.e., where the first edges are 
rising edges), the second edges are falling edges. 

[0085] In some embodiments, counting the first number of 
counts is repeated every M periods of the input clock signal, 
where M is an integer. In some embodiments, M is five. 
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[0086] Fig. 16 illustrates the steps of a second novel 
method of providing a phased output clock signal from an 
input clock signal, according to another embodiment of the 
invention. In some embodiments, the steps of Fig. 16 are 
performed by a circuit implemented in a programmable logic 
device (PLD) , for example in a field programmable gate array 
(FPGA) or a complex programmable logic device (CPLD) . 
[0087] In step 1601, a first number of counts between 
successive initial edges of an input clock signal is counted. 
For example, in the embodiment of Fig. 5, counter 1 (200) 
counts a number P of oscillator clock cycles within a single 
period of input clock signal CLKIN. In some embodiments, the 
initial edges are rising edges. 

[0088] In step 1602, a DCC value is determined, based at 
least in part on the first number. The DCC value can be 
determined, for example, by dividing the first number by two, 
four, or another number. 

[0089] In step 1603, a delay value is determined. The 
delay value can be, for example, a set delay provided by a 
PLD user. In some embodiments, the set delay value is 
compared with a maximum value. If the set delay does not 
exceed the maximum value, the set delay value is selected as 
the delay value. If the set delay exceeds the maximum value, 
the maximum value is selected as the delay value. 
[0090] In some embodiments, the maximum value is 
determined by dividing the first number by two, by four, or 
by another number. In some embodiments, the maximum value is 
determined by adding together one-half of the first number 
and one-quarter of the first number. In some embodiments, 
the maximum value is determined by subtracting a 
predetermined value from the first number. In some 
embodiments, the maximum value is independent of the first 
number . 

[0091] In step 1604, a second number of counts is counted 
following each initial edge of the input clock signal, and 
the second number is compared with the delay value. 
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[0092] In step 1605, a first edge is provided on an output 
clock signal when the second number reaches the delay value. 
In some embodiments, the first edges are rising edges. 
[0093] In step 1606, a third number of counts is counted 
following each first edge on the output clock signal, and the 
third number is compared with the DCC value. 
[0094] In step 1607, a second edge is provided on the 
output clock signal when the third number reaches the DCC 
value. In some embodiments (i.e., where the first edges are 
rising edges), the second edges are falling edges. 
[0095] In some embodiments, counting the first number of 
counts is repeated every M periods of the input clock signal,: 
where M is an integer. In some embodiments, M is five. 
[0096] Those having skill in the relevant arts of the 
invention will now perceive various modifications and 
additions that can be made as a result of the disclosure 
herein. For example, the above text describes the circuits 
and methods of the invention in the context of ICs such as 
programmable logic devices (PLDs) . However, the circuits of 
the invention can also be implemented in other electronic 
systems, for example, in non-programmable integrated 
circuits, or in printed circuit boards including discrete 
devices . 

[0097] Further, inverters, logical OR gates, NOR gates, 
XNOR gates, NAND gates, flip-flops, counter circuits, maximum 
value circuits, counters, oscillators, registers, output 
clock generators, dividers, adders, decrementers , 
multiplexers, comparators, and other components other than 
those described herein can be used to implement the 
invention. Active-high signals can be replaced with active- 
low signals by making straightforward alterations to the 
circuitry, such as are well known in the art of circuit 
design. Logical circuits can be replaced by their logical 
equivalents by appropriately inverting input and output 
signals, as is also well known. 
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[0098] Moreover, some components are shown directly 
connected to one another while others are shown connected via 
intermediate components. In each instance the method of 
interconnection establishes some desired electrical 
communication between two or more circuit nodes. Such 
communication can often be accomplished using a number of 
circuit configurations, as will be understood by those of 
skill in the art. 

[0099] Accordingly, all such modifications and additions 
are deemed to be within the scope of the invention, which is 
to be limited only by the appended claims and their 
equivalents . 
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